NEW METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO SURVIVAL ANALYSIS AND STATISTICAL REDUNDANCY ANALYSIS USING GENE EXPRESSION DATA by
نویسندگان
چکیده
by Simin Hu An important application of microarray research is to develop cancer diagnostic and prognostic tools based on tumor genetic profiles. For easy interpretation, such studies aim to identify a small fraction of genes to build molecular predictors of clinical outcomes from at least thousands of genes thus require methodologies that can model high dimensional covariates and accomplish variable selection simultaneously. One interesting area is modeling cancer patients’ survival time or time to cancer reoccurrence with gene expression data. In the first part of this dissertation, we propose a new penalized weighted least squares method for model estimation and variable selection in accelerated failure time models. In this method, right censored observations are used as censoring constraints in optimizing the weighted least squares objective function. We also include ridge penalty to deal with singularity caused by collinearity and high dimensionality and use the least absolute shrinkage and selection operator to achieve
منابع مشابه
Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملشناسایی ژنهای مرتبط با بقا در سرطان کلیه با استفاده از روش مؤلفههای اصلی لاسو
Background: Identification of correlated genes with survival by gene expression data is an important application of microarray data. The purpose of this study is to identify correlated genes with survival of conventional renal cell carcinoma (cRCC) patients based on gene expression profiles. Methods: This study is a survival analysis with high dimensional covariates and containing 14814 gene...
متن کاملEffect of Rosemary Essential Oil on BIM Apoptotic Gene Expression in MCF 7 Breast Cancer Cell Line
Introduction: Breast cancer is the most common cancer among women and its treatment is associated with many side effects. Herbal medicine has fewer side effects than chemical drugs, so they are especially important in the treatment of many diseases. Rosemary plant has anti-cancer effects due to its antioxidant properties. The aim of this study was to evaluate the effect of Rosemary essential oi...
متن کاملSample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کاملPredicting survival from microarray data - a comparative study
MOTIVATION Survival prediction from gene expression data and other high-dimensional genomic data has been subject to much research during the last years. These kinds of data are associated with the methodological problem of having many more gene expression values than individuals. In addition, the responses are censored survival times. Most of the proposed methods handle this by using Cox's pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006